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WHAT IS CLAIMED IS: 

1 . A method for associating a gene G in the genome of a species with a 
clinical trait T exhibited by one or more organisms in a plurality of organisms of said 
species, the method comprising: 
5 (A) identifying an expression quantitative trait loci (eQTL) for said gene G using 

a first quantitative trait loci (QTL) analysis, wherein said first QTL analysis uses a 
plurality of expression statistics for gene G as a quantitative trait, wherein each 
expression statistic in said plurality of expression statistics represents an expression value 
for gene G in an organism in said plurality of organisms; 

10 (B) identifying a clinical quantitative trait loci (cQTL) that is linked to said 

clinical trait T using a second QTL analysis, wherein said second QTL analysis uses a 
plurality of phenotypic values as a quantitative trait, wherein each phenotypic value in 
said plurality of phenotypic values is a phenotypic value for said clinical trait T in an 
organism in said plurality of organisms; and 

15 (C) determining whether said eQTL and said cQTL colocalize to the same locus in 

the genome of said species, wherein, when said eQTL and said cQTL colocalize to the 
same locus, said gene G is associated with said clinical trait T. 



2. The method of claim 1, wherein said determining step further comprises 
20 determining whether the locus of said eQTL in the genome of said species corresponds to 

the physical location of said gene G in the genome of said species, wherein, when said 
locus of said eQTL in the genome of said species corresponds to the physical location of 
said gene G in the genome of said species, the association between said gene G and said 
clinical trait T is confirmed. 

25 

3. The method of claim 2, wherein said eQTL corresponds to the physical 
location of said gene G when the eQTL and gene G colocalize within 3cM of each other 
in the genome of said species. 

30 4. The method of claim 2, wherein said eQTL corresponds to the physical 

location of said gene G when the eQTL and gene G colocalize within IcM of each other 
in the genome of said species. 
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5. The method of claim 1 , wherein said eQTL and said cQTL are not 
colocalized unless a test forpleiotropy indicates that said eQTL and said cQTL are 
represented by a QTL that is common to both said eQTL and said cQTL. 

6. The method of claim 1, wherein said first QTL analysis and said second 
QTL analysis each uses a genetic map. 

7. The method of claim 6, which further comprises, prior to the first 
identifying step, a step of constructing said genetic map from a set of genetic markers 
associated with said plurality of organisms. 

8. The method of claim 7, wherein said set of genetic markers comprises 
single nucleotide polymorphisms (SNPs), microsatellite markers, restriction fragment 
length polymorphisms, short tandem repeats, DNA methylation markers, sequence length 
polymorphisms, random amplified polymorphic DNA, amplified fragment length 
polymorphisms, or simple sequence repeats. 

9. The method of claim 7, wherein genotype data is used in said constructing 
step and wherein said genotype data comprises knowledge of which alleles, for each 
marker in said set of genetic markers, are present in each organism in said plurality of 
organisms. 

10. The method of claim 7, wherein said plurality of organisms represents a 
segregating population and pedigree data is used in said constructing step, and wherein 
said pedigree data shows one or more relationships between organisms in said plurality of 
organisms. 



1 1 . The method of claim 10, wherein said plurality of organisms comprises an 
F2 population, a F, population, a F 2 >3 population, or a Design IE population and said one 
or more relationships between organisms in said plurality of organisms indicates which 
organisms in said plurality of organisms are members of said F2 population, said F, 
population, said F 2: 3 population, or said Design EH population. 
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12. The method of claim 1, wherein each said expression value is a normalized 
expression level measurement for said gene G in an organism in said plurality of 
organisms. 



5 13. The method of claim 12, wherein each said expression level measurement 

is determined by measuring an amount of a cellular constituent that corresponds to said 
gene G in one or more cells from an organism in said plurality of organisms. 

14. The method of claim 13, wherein said amount of said cellular constituent 
10 comprises an abundance of an RNA present in said one or more cells of said organism, an 

abundance of a protein in said one or more cells of said organism, an abundance of an 
mRNA expressing said gene, or a degree of protein modification. 

15. The method of claim 14, wherein said abundance of said RNA is measured 
15 by a method comprising contacting a gene transcript array with said RNA from said one 

or more cells of said organism, or with nucleic acid derived from said RNA, wherein said 
gene transcript array comprises a positionally addressable surface with attached nucleic 
acids or nucleic acid mimics, wherein said nucleic acids or nucleic acid mimi cs are 
capable of hybridizing with said RNA species, or with nucleic acid derived from said 
20 RNA species. 

16. The method of claim 12, wherein said normalized expression level 
measurement is obtained by a normalization technique selected from the group consisting 
of Z-score of intensity, median intensity, log median intensity, Z-score standard deviation 

25 log of intensity, Z-score mean absolute deviation of log intensity, calibration DNA gene 
set, user normalization gene set, ratio median intensity correction, and intensity 
background correction. 



17. The method of claim 1, wherein said first QTL analysis comprises: 
30 (i) testing for linkage between (a) the genotype of said plurality of organisms at a 

position in the genome of said species and (b) said plurality of expression statistics for 
gene G; 

(ii) advancing the position in said genome by an amount; and 

(iii) repeating steps (i) and (ii) until all or a portion of the genome of said species 
35 has been tested 
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18. The method of claim 17, wherein said amount is less than 100 
centiMorgans. 

5 19. The method of claim 1 7, wherein said amount is less than 1 0 

centiMorgans. 

20. The method of claim 17, wherein said amount is less than 5 centiMorgans. 

21. The method of claim 17, wherein said amount is less than 2.5 
10 centiMorgans. 

22. The method of claim 17, wherein said testing comprises performing 
linkage analysis or association analysis. 

15 23 . The method of claim 22, wherein said linkage analysis or association 

analysis generates a statistical score for said position in the genome of said species. 

24. The method of claim 23, wherein said testing is linkage analysis and said 
statistical score is a logarithm of the odds (lod) score. 

20 

25. The method of claim 24, wherein said eQTL is represented by a lod score 
that is greater than 2.0. 

26. The method of claim 24, wherein said eQTL is represented by a lod score 
25 that is greater than 3 .0. 

27. The method of claim 24, wherein said eQTL is represented by a lod score 
that is greater than 4.0. 

30 28. The method of claim 24, wherein said eQTL is represented by a lod score 

that is greater than 5.0. 

29. The method of claim 1, wherein said second QTL analysis comprises: 
(i) testing for linkage between (a) the genotype of said plurality of organisms at a 
35 position in the genome of said species and (b) said plurality of phenotypic values; 
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(ii) advancing the position in said genome by an amount; and 

(iii) repeating steps (i) and (ii) until all or a portion of the genome of said species 
has been tested. 



5 30. The method of claim 29, wherein said amount is less than 1 00 

centiMorgans. 

3 1 . The method of claim 29, wherein said amount is less than 1 0 
centiMorgans. 

10 

32. The method of claim 29, wherein said amount is less than 5 centiMorgans. 

33. The method of claim 29, wherein said amount is less than 2.5 
centiMorgans. 

15 

34. The method of claim 29, wherein said testing comprises performing 
linkage analysis or association analysis. 

35. The method of claim 34, wherein said linkage analysis or association 
analysis generates a statistical score for said position in the genome of said species. 

36. The method of claim 35, wherein said testing is linkage analysis and said 
statistical score is a logarithm of the odds (lod) score. 

37. The method of claim 36, wherein said cQTL is represented by a lod score 
that is greater than 2.0. 

38. The method of claim 36, wherein said cQTL is represented by a lod score 
that is greater than 3.0. 

39. The method of claim 36, wherein said cQTL is represented by a lod score 
that is greater than 4.0. 



40. The method of claim 36, wherein said cQTL is represented by a lod score 
that is greater than 5.0. 
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4 1 . The method of claim 1 , wherein said plurality of organisms is human. 

42. The method of claim 1, wherein said clinical trait T is a complex trait. 

43. The method of claim 42, wherein said complex trait is characterized by an 
allele that exhibits incomplete penetrance in said species. 

44. The method of claim 42, wherein said complex trait is a disease that is 
contracted by an organism in said population, and wherein said organism inherits no 
predisposing allele to said disease. 

45. The method of claim 42, wherein said complex trait arises when any of a 
plurality of different genes in the genome of said species is mutated. 

46. The method of claim 42, wherein said complex trait requires the 
simultaneous presence of mutations in a plurality of genes in the genome of said species. 

47. The method of claim 42-, wherein said complex trait is associated with a 
high frequency of disease-causing alleles in said species. 

48. The method of claim 42, wherein said complex trait is a phenotype that 
does not exhibit Mendelian recessive or dominant inheritance attributable to a single gene 
locus. 



49. The method of claim 42, wherein said complex trait is asthma, ataxia 
telangiectasia, bipolar disorder, cancer, common late-onset Alzheimer's disease, diabetes, 
heart disease, hereditary early-onset Alzheimer's disease, hereditary nonpolyposis colon 
cancer, hypertension, infection, maturity-onset diabetes of the young, mellitus, migraine, 
nonalcoholic fatty liver, nonalcoholic steatohepatitis, non-insulin-dependent diabetes 
mellitus, obesity, polycystic kidney disease, psoriases, schizophrenia, or xeroderma 
pigmentosum. 
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50. The method of claim 1, wherein said eQTL and said cQTL colocalize to 
the same locus in the genome of said species when the physical location of the eQTL in 
said genome is within 40 cM of the physical location of the cQTL in said genome. 

5 51. The method of claim 1 , wherein said eQTL and said cQTL colocalize to 

the same locus in the genome of said species when the physical location of the eQTL in 
said genome is within 20 cM of the physical location of the cQTL in said genome. 

52. The method of claim 1, wherein said eQTL and said cQTL colocalize to 
10 the same locus in the genome of said species when the physical location of the eQTL in 

said genome is within 10 cM of the physical location of the cQTL in said genome. 

53. The method of claim 1, wherein said eQTL and said cQTL colocalize to 
the same locus in the genome of said species when the physical location of the eQTL in 

15 said genome is within 6 cM of the physical location of the cQTL in said genome. 



54. A computer program product for use in conjunction with a computer 
system, the computer program product comprising a computer readable storage medium 
and a computer program mechanism embedded therein, the computer program mechanism . 

20 for associating a gene G in the genome of a species with a clinical trait T exhibited by one 
or more organisms in a plurality of organisms of said species, the computer program 
mechanism comprising: 

an expression quantitative trait loci (eQTL) identification module for identifying 
an expression quantitative trait loci (eQTL) for said gene G using a first quantitative trait 

25 loci (QTL) analysis, wherein said first QTL analysis uses a plurality of expression 
statistics for gene G as a quantitative trait, wherein each expression statistic in said - 
plurality of expression statistics represents an expression value for gene G in an organism 
in said plurality of organisms; 

a clinical quantitative trait loci (cQTL) identification module for identifying a 

30 clinical quantitative trait loci (cQTL) that is linked to said clinical trait T using a second 
QTL analysis, wherein said second QTL analysis uses a plurality of phenotypic values as 
a quantitative trait, wherein each phenotypic value in said plurality of phenotypic values 
represents a phenotypic value for said clinical trait T in an organism in said plurality of 
organisms; and 
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a determination module for determining whether said eQTL and said cQTL 
colocalize to the same locus in the genome of said species, wherein, when said eQTL and 
said cQTL colocalize to the same locus, said gene G is associated with said clinical trait 
T. 

5 

55. The computer program product of claim 54, wherein said determining 
further comprises determining whether the locus of said eQTL in the genome of said 
species corresponds to the physical location of said gene G in the genome of said species, 
wherein, when said locus of said eQTL in the genome of said species corresponds to the 

10 physical location of said gene G in the genome of said species said gene G is associated 
with said clinical trait T. 

56. The computer program product of claim 55, wherein said eQTL 
corresponds to the physical location of said gene G when the eQTL and gene G colocalize 

1 5 within 3cM of each other in the genome of said species. 

57. The computer program product of claim 55, wherein said eQTL 
corresponds to the physical location of said gene G when the eQTL and gene G colocalize 
within IcM of each other in the genome of said species. 

20 

58. The computer program product of claim 54, wherein said eQTL and said 
cQTL are not considered colocalized unless a test for pleiotropy indicates that said eQTL 
and said cQTL are represented by a QTL that is common to both said eQTL and said 
cQTL. 

25 

59. The computer program product of claim 54, wherein said first QTL 
analysis and said second QTL analysis each uses a genetic map. 

60. The computer program product of claim 59, the computer program 

30 mechanism further comprising a genetic map construction module for constructing said 
genetic map from a set of genetic markers associated with said plurality of organisms. 

6 1 . The computer program product of claim 60, wherein said set of genetic 

markers comprises single nucleotide polymorphisms (SNPs), microsatellite markers, 

35 restriction fragment length polymorphisms, short tandem repeats, DNA methylation 
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markers, sequence length polymorphisms, random amplified polymorphic DNA, 
amplified fragment length polymorphisms, or simple sequence repeats. 

62. The computer program product of claim 60, wherein genotype data is used 
5 by said genetic map construction module and wherein said genotype data comprises 

knowledge of which alleles, for each marker in said set of genetic markers, are present in 
each organism in said plurality of organisms. 

63. The computer program product of claim 60, wherein said plurality of 
1 0 organisms represents a segregating population and pedigree data is used in said 

constructing step, and wherein said pedigree data shows one or more relationships 
between organisms in said plurality of organisms. 

64. The computer program product of claim 63, wherein said plurality of 

1 5 organisms comprises an F2 population, a F, population, a F 2: 3 population, or a Design m 
population and said one or more relationships between organisms in said plurality of 
organisms indicates which organisms in said plurality of organisms are members of said • 
F2 population, said F, population, said F 2: 3 population, or said Design m population. 

20 65. The computer program product of claim 54, wherein each said expression 

value is a normalized expression level measurement for said gene Gin an organism in 
said plurality of organisms. 

66. The computer program product of claim 65, wherein each said expression 
25 level measurement is determined by measuring an amount of a cellular constituent that 

corresponds to said gene G in one or more cells from an organism in said plurality of 
organisms. 

67. The computer program product of claim 66, wherein said amount of said 
30 cellular constituent comprises an abundance of an RNA present in said one or more cells 

of said organism, an abundance of a protein in said one or more cells of said organism, an 
abundance of an mRNA expressing said gene, or a degree of protein modification. 

68 . The computer program product of claim 67, wherein said abundance of 
35 said RNA is measured by a method comprising contacting a gene transcript array with 
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said RNA from said one or more cells of said organism, or with nucleic acid derived from 
said RNA, wherein said gene transcript array comprises a positionally addressable surface 
with attached nucleic acids or nucleic acid mimics, wherein said nucleic acids or nucleic 
acid mimics are capable of hybridizing with said RNA species, or with nucleic acid 
derived from said RNA species. 



69. The computer program product of claim 68, wherein said normalized 
expression level measurement is obtained by a normalization technique selected from the 
group consisting of Z-score of intensity, median intensity, log median intensity, Z-score 

10 standard deviation log of intensity, Z-score mean absolute deviation of log intensity, 

calibration DNA gene set, user normalization gene set, ratio median intensity correction, 
and intensity background correction. 

70. The computer program product of claim 54, wherein said first QTL 
15 analysis comprises: 

(i) testing for linkage between (a) the genotype of said plurality of organisms at a 
position in the genome of said species and (b) said plurality of expression statistics for 
geneG; 

(ii) advancing the position in said genome by an amount; and 

20 (iii) repeating steps (i) and (ii) until a portion of the genome of said species has 

been tested. 

7 1 . The computer program product of claim 70, wherein said amount is less 
than 100 centiMorgans. 

25 

72. The computer program product of claim 70, wherein said amount is less 
than 10 centiMorgans. 



73 . The computer program product of claim 70, wherein said amount is less 
30 than 5 centiMorgans. 

74. The computer program product of claim 70, wherein said amount is less 
than 2.5 centiMorgans. 
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75 . The computer program product of claim 70, wherein said testing comprises 
performing linkage analysis or association analysis. 

76. The computer program product of claim 75, wherein said linkage analysis 
5 or association analysis generates a statistical score for said position in the genome of said 

species. 

77. The computer program product of claim 76, wherein said testing is linkage 
analysis and said statistical score is a logarithm of the odds (lod) score. 

10 

78. The computer program product of claim 77, wherein said eQTL is 
represented by a lod score that is greater than 2.0. 

79. The computer program product of claim 77, wherein said eQTL is 
15 represented by a lod score that is greater than 3.0. 

80. The computer program product of claim 77, wherein said eQTL is 
represented by a lod score that is greater than 4.0. 

20 81. The computer program product of claim 77, wherein said eQTL is 

represented by a lod score that is greater than 5.0. 

82. The computer program product of claim 54, wherein said second QTL 
analysis comprises: 

25 (i) testing for linkage between (a) the genotype of said plurality of organisms at a 

position in the genome of said species and (b) said plurality of phenotypic values; 

(ii) advancing the position in said genome by an amount; and 

(iii) repeating steps (i) and (ii) until a portion of the genome of said species has 
been tested. 

30 

83. The computer program product of claim 82, wherein said amount is less 
than 100 centiMorgans. 

84. The computer program product of claim 82, wherein said amount is less 
35 than 10 centiMorgans. 
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85. The computer program product of claim82, wherein said amount is less 
than 5 centiMorgans. 

5 86. The computer program product of claim 82, wherein said amount is less 

than 2.5 centiMorgans. 

87. The computer program product of claim 82, wherein said testing comprises 
performing linkage analysis or association analysis. 

10 

88. The computer program product of claim 87, wherein said linkage analysis 
or association analysis generates a statistical score for said position in the genome of said 
species. 

1 5 89. The computer program product of claim 88, wherein said testing is linkage 

analysis and said statistical score is a logarithm of the odds (lod) score. 

90. The computer program product of claim 89, wherein said cQTL is 
represented by a lod score that is greater than 2.0. 

20 

91 . The computer program product of claim 89, wherein said cQTL is 
represented by a lod score that is greater than 3.0. 

92. The computer program product of claim 89, wherein said cQTL is 
25 represented by a lod score that is greater than 4.0. 

93. The computer program product of claim 89, wherein said cQTL is 
represented by a lod score that is greater than 5.0. 

30 94. The computer program product of claim 54, wherein said plurality of 

organisms is human. 

95. The computer program product of claim 54, wherein said clinical trait T is 
a complex trait. 

35 
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96. The computer program product of claim 95, wherein said complex trait is 
characterized by an allele that exhibits incomplete penetrance in said species. 

97. The computer program product of claim 95, wherein said complex trait is a 
5 disease that is contracted by an organism in said population, and wherein said organism 

inherits no predisposing allele to said disease. 

98. The computer program product of claim 95, wherein said complex trait 
arises when any of a plurality of different genes in the genome of said species is mutated. 



99. The computer program product of claim 95, wherein said complex trait 
requires the simultaneous presence of mutations in a plurality of genes in the genome of 
said species. 



associated with a high frequency of disease-causing alleles in said species. 

101. The computer program product of claim 95, wherein said complex trait is a . 
phenotype that does not exhibit Mendelian recessive or dominant inheritance attributable 

20 to a single gene locus. 

1 02. The computer program product of claim 95, wherein said complex trait is 
asthma, ataxia telangiectasia, bipolar disorder, cancer, common late-onset Alzheimer's 
disease, diabetes, heart disease, hereditary early-onset Alzheimer's disease, hereditary 

25 nonpolyposis colon cancer, hypertension, infection, maturity-onset diabetes of the young, 
mellitas, migraine, nonalcoholic fatty liver, nonalcoholic steatohepatitis, non-insulin- 
dependent diabetes mellitus, obesity, polycystic kidney disease, psoriases, schizophrenia, 
or xeroderma pigmentosum. 

30 1 03. The computer program product of claim 54, wherein said eQTL and said 

cQTL colocalize to the same locus in the genome of said species when the physical 
location of the eQTL in said genome is within 40 cM of the physical location of the cQTL 
in said genome. 



10 
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1 00. The computer program product of claim 95, wherein said complex trait is 
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104. The computer program product of claim 54, wherein said eQTL and said 
cQTL colocalize to the same locus in the genome of said species when the physical 
location of the eQTL in said genome is within 20 cM of the physical location of the cQTL 
in said genome. 

5 

1 05 . The computer program product of claim 54, wherein said eQTL and said 
cQTL colocalize to the same locus in the genome of said species when the physical 
location of the eQTL in said genome is within 10 cM of the physical location of the cQTL 
in said genome. 

10 

106. The computer program product of claim 54, wherein said eQTL and said 
cQTL colocalize to the same locus in the genome of said species when the physical 
location of the eQTL in said genome is within 6 cM of the physical location of the cQTL 
in said genome. 

15 

107. A computer system for associating a gene G in the genome of a species 
with a clinical trait T exhibited by one or more organisms in a plurality of organisms of 
said species, the computer system comprising: 

a central processing unit; 

20 a memory, coupled to the central processing unit, the memory storing an 

expression quantitative trait loci (eQTL) identification module, a clinical quantitative trait 
loci (cQTL) identification module, and a determination module; wherein 

the expression quantitative trait loci (eQTL) identification module comprises 
instructions for identifying an expression quantitative trait loci (eQTL) for said gene G 

25 using a first quantitative trait loci (QTL) analysis, wherein said first QTL analysis uses a 
plurality of expression statistics for gene G as a quantitative trait, wherein each 
expression statistic in said plurality of expression statistics represents an expression value 
for gene G in an organism in said plurality of organisms; 

the clinical quantitative trait loci (cQTL) identification module comprises 

30 instructions for identifying a clinical quantitative trait loci (cQTL) that is linked to said 
clinical trait T using a second QTL analysis, wherein said second QTL analysis uses a 
plurality of phenotypic values as a quantitative trait, wherein each phenotypic value in 
said plurality of phenotypic values represents a phenotypic value for said clinical trait T 
in an organism in said plurality of organisms; and 
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the determination module comprises instructions for detennining whether said 
eQTL and said cQTL colocalize to the same locus in the genome of said species, wherein, 
when said eQTL and said cQTL colocalize to the same locus, said gene G is associated 
with said clinical trait T. 



108. The computer system of claim 1 07, wherein said instructions for 
detennining further comprise detennining whether the locus of said eQTL in the genome 
of said species corresponds to the physical location of said gene G in the genome of said 
species, wherein, when said locus of said eQTL in the genome of said species corresponds 

10 to the physical location of said gene G in the genome of said species said gene G is 
associated with said clinical trait T. 

1 09. The computer system of claim 1 08, wherein said eQTL corresponds to the 
physical location of said gene G when the eQTL and gene G colocalize within 3cM of 

15 each other in the genome of said species. 

110. The computer system of claim 1 08, wherein said eQTL corresponds to the 
physical location of said gene G when the eQTL and gene G colocalize within 1 cM of 
each other in the genome of said species. 

20 

111. The computer system of claim 1 07, wherein said eQTL and said cQTL are 
not considered colocalized unless a test forpleiotropy indicates that said eQTL and said 
cQTL are represented by a QTL that is common to both said eQTL and said cQTL. 

25 112. The computer system of claim 1 07, wherein said first QTL analysis and 

said second QTL analysis each uses a genetic map that represents the genome of said 
species. 

113. The computer system of claim 1 1 2, the memory further storing a genetic 
30 map construction module, the genetic map construction module including instructions for 
constructing said genetic map from a set of genetic markers associated with said plurality 
of organisms. 
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comprises single nucleotide polymorphisms (SNPs), microsatellite markers, restriction 
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fragment length polymorphisms, short tandem repeats, DNA methylation markers, 
sequence length polymorphisms, random amplified polymorphic DNA, amplified 
fragment length polymorphisms, or simple sequence repeats. 

115. The computer system of claim 113, wherein genotype data is used by said 
genetic map construction module, wherein said genotype data comprises knowledge of 
which alleles, for each marker in said set of genetic markers, are present in each organism 
in said plurality of organisms. 

116. The computer system of claim 113, wherein said plurality of organisms 
represents a segregating population and pedigree data is used in said constructing step, 
and wherein said pedigree data shows one or more relationships between organisms in 
said plurality of organisms. 

117. The computer system of claim 1 13, wherein said plurality of organisms 
comprises an F2 population, a F, population, a F23 population, or a Design m population 
and said one or more relationships between organisms in said plurality of organisms 
indicates which organisms in said plurality of organisms are members of said F2 
population, said F, population, said F23 population, or said Design HI population. 

118. The computer system of claim 107, wherein each said expression value is a 
normalized expression level measurement for said gene G in an organism in said plurality 
of organisms. 

119. The computer system of claim 1 1 8, wherein each said expression level 
measurement is determined by measuring an amount of a cellular constituent that 
corresponds to said gene G in one or more cells from an organism in said plurality of 
organisms. 

120. The computer system of claim 1 19, wherein said amount of said cellular 
constituent comprises an abundance of an RNA present in said one or more cells of said 
organism, an abundance of a protein in said one or more cells of said organism, an 
abundance of an mRNA expressing said gene, or a degree of protein modification. 
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121 . The computer system of claim 120, wherein said abundance of said RNA 
is measured by a method comprising contacting a gene transcript array with said RNA 
from said one or more cells of said organism, or with nucleic acid derived from said 
RNA, wherein said gene transcript array comprises a positionally addressable surface 

5 with attached nucleic acids or nucleic acid mimics, wherein said nucleic acids or nucleic 
acid mimics are capable of hybridizing with said RNA species, or with nucleic acid 
derived from said RNA species. 

1 22. The computer system of claim 121, wherein said normalized expression 
10 level measurement is obtained by a normalization technique selected from the group 

consisting of Z-score of intensity, median intensity, log median intensity, Z-score 
standard deviation log of intensity, Z-score mean absolute deviation of log intensity, 
calibration DNA gene set, user normalization gene set, ratio median intensity correction, 
and intensity background correction. 

15 

123. The computer system of claim 107, wherein said first QTL analysis 
comprises: 

(i) testing for linkage between (a) the genotype of said plurality of organisms at a 
position in the genome of said species and (b) said plurality of expression statistics for 

20 gene G; 

(ii) advancing the position in said genome by an amount; and 

(iii) repeating steps (i) and (ii) until a portion of the genome of said species has 
been tested. 

25 124. The computer system of claim 123, wherein said amount is less than 100 

centiMorgans. 

125 . The computer system of claim 123, wherein said amount is less than 10 
centiMorgans. 

30 

126. The computer system of claim 123, wherein said amount is less than 5 
centiMorgans. 
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128. The computer system of claim 123, wherein said testing comprises 
performing linkage analysis or association analysis. 

5 129. The computer system of claim 128, wherein said linkage analysis or 

association analysis generates a statistical score for said position in the genome of said 
species. 

1 30. The computer system of claim 1 29, wherein said testing is linkage analysis 
10 and said statistical score is a logarithm of the odds (lod) score. 

131. The computer system of claim 130, wherein said eQTL is represented by a 
lod score that is greater than 2.0. 

15 1 32. The computer system of claim 130, wherein said eQTL is represented by a 

lod score that is greater than 3.0. 

133. The computer system of claim 130, wherein said eQTL is represented by a 
. lod score that is greater than 4.0. 

20 

1 34. The computer system of claim 1 30, wherein said eQTL is represented by a 
lod score that is greater than 5.0. 

135. The computer system of claim 107, wherein said second QTL analysis 
25 comprises: 

(i) testing for linkage between (a) the genotype of said plurality of organisms at a 
position in the genome of said species and (b) said plurality of phenotypic values; 

(ii) advancing the position in said genome by an amount; and 

(iii) repeating steps (i) and (ii) until a portion of the genome of said species has 
30 been tested. 

136. The computer system of claim 135, wherein said amount is less than 100 
centiMorgans. 
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137. The computer system of claim 135, wherein said amount is less than 1 0 
centiMorgans. 

138. The computer system of claim 135, wherein said amount is less than 5 
5 centiMorgans. 

139. The computer system of claim 135, wherein said amount is less than 2.5 
centiMorgans. 

10 140. The computer system of claim 135, wherein said testing comprises 

performing linkage analysis or association analysis. 

141 . The computer system of claim 140, wherein said linkage analysis or 
association analysis generates a statistical score for said position in the genome of said 

15 species. 

142. The computer system of claim 141, wherein said testing is linkage analysis 
and said statistical score is a logarithm of the odds (lod) score. 

20 143. The computer system of claim 142, wherein said cQTL is represented by a 

lod score that is greater than 2.0. 

144. The computer system of claim 142, wherein said cQTL is represented by a 
lod score that is greater than 3.0. 

25 

145. The computer system of claim 142, wherein said cQTL is represented by a 
lod score that is greater than 4.0. 

146. The computer system of claim 142, wherein said cQTL is represented by a 
30 lod score that is greater than 5.0. 

1 47. The computer system of claim 1 07, wherein said plurality of organisms is 

human. 
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148. The computer system of claim 107, wherein said clinical trait T is a 
complex trait. 

149. The computer system of claim 148, wherein said complex trait is 
characterized by an allele that exhibits incomplete penetrance in said species. 

150. The computer system of claim 148, wherein said complex trait is a disease 
that is contracted by an organism in said population, and wherein said organism inherits 
no predisposing allele to said disease. 

151. The computer system of claim 148, wherein said complex trait arises when 
any of a plurality of different genes in the genome of said species is mutated. 

1 52. The computer system of claim 148, wherein said complex trait requires the 
simultaneous presence of mutations in a plurality of genes in the genome of said species. 

153. The computer system of claim 148, wherein said complex trait is 
associated with a high frequency of disease-causing alleles in said species. 

154. The computer system of claim 148, wherein said complex trait is a 
phenotype that does not exhibit Mendelian recessive or dominant inheritance attributable 
to a single gene locus. 

155. The computer system of claim 148, wherein said complex trait is asthma, 
ataxia telangiectasia, bipolar disorder, cancer, common late-onset Alzheimer's disease, 
diabetes, heart disease, hereditary early-onset Alzheimer's disease, hereditary 
nonpolyposis colon cancer, hypertension, infection, maturity-onset diabetes of the young, 
mellitus, migraine, nonalcoholic fatty liver, nonalcoholic steatohepatitis, non-insulin- 
dependent diabetes mellitus, obesity, polycystic kidney disease, psoriases, schizophrenia, 
or xeroderma pigmentosum. 

156. The computer system of claim 1 07, wherein said eQTL and said cQTL 
colocalize to the same locus in the genome of said species when the physical location of 
the eQTL in said genome is within 40 cM of the physical location of the cQTL in said 



genome. 
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157. The computer system of claim 1 07, wherein said eQTL and said cQTL 
colocalize to the same locus in the genome of said species when the physical location of 
the eQTL in said genome is within 20 cM of the physical location of the cQTL in said 
genome. 

5 

158. The computer system of claim 107, wherein said eQTL and said cQTL 
colocalize to the same locus in the genome of said species when the physical location of 
the eQTL in said genome is within 10 cM of the physical location of the cQTL in said 
genome. 

10 

159. The computer system of claim 107, wherein said eQTL and said cQTL 
colocalize to the same locus in the genome of said species when the physical location of 
the eQTL in said genome is within 6 cM of the physical location of the cQTL in said 
genome. 

15 

1 60. A method for determining the topology of a biological pathway that affects 
a complex trait, the method comprising: 

(A) identifying one or more expression quantitative trait loci (eQTL) for a gene in 
a plurality of genes using a first quantitative trait loci (QTL) analysis, wherein said first 

20 QTL analysis uses a plurality of expression statistics for the gene as a quantitative trait, 
wherein each expression statistic in said plurality of expression statistics represents an 
expression value for the gene in an organism in a plurality of organisms of a species; 

(B) repeating step (A) a first number of times, wherein each repetition of step (A) 
uses a different gene in said plurality of genes; 

25 (C) identifying a clinical quantitative trait loci (cQTL) that is linked to a clinical 

trait in a plurality of clinical traits using a second QTL analysis, wherein said second QTL 
analysis uses a plurality of phenotypic values as a quantitative trait, wherein each 
phenotypic value in said plurality of phenotypic values is a phenotypic value for the 
clinical trait in the plurality of clinical traits in an organism in said plurality of organisms; 

30 (D) repeating step (C) a second number of times, wherein each repetition of step 

(C) uses a different clinical trait in a plurality of clinical traits; and 

(E) using (i) an identity of each eQTL, identified in an iteration of step (A), that 

colocalizes with a cQTL, identified in an iteration of step (C), and (ii) a physical location 

of each gene in said plurality of genes on a molecular map for said species, in order to 

35 determine the topology of said biological pathway that affects said complex trait. 
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161. The method of claim 1 60, wherein three or more clinical traits are analyzed 
in respective iterations of step (C). 

5 1 62. The method of claim 160, wherein eight or more clinical traits are analyzed 

in respective iterations of step (C). 

163. The method of claim 160, wherein three or more genes are analyzed in 
respective iterations of step (A). 

10 

1 64. The method of claim 1 60, wherein eight or more genes are analyzed in 
respective iterations of step (A). 

1 65 . The method of claim 1 60, wherein said molecular map is a marker map 
15 and an eQTL is colocalized with a cQTL when the eQTL and the cQTL fall within 1 cM 

of each other on said marker map. 



1-66. The method of claim 1 60, wherein said molecular map is a genomic map 
and an eQTL is colocalized with a cQTL when they fall within 500 bases of each other on 
20 said genomic map. 

1 67. The method of claim 160, wherein said first QTL analysis and said second 
QTL analysis each uses a genetic map that represents the genome of said species. 

25 168. The method of claim 160, wherein said genetic map is constructed from 

single nucleotide polymorphisms (SNPs), microsatellite markers, restriction fragment 
length polymorphisms, short tandem repeats, DNA methylation markers, sequence length 
polymorphisms, random amplified polymorphic DNA, amplified fragment length 
polymorphisms, or simple sequence repeats. 

30 

169. The method of claim 160, wherein each said expression value is a 
normalized expression level measurement for the gene in an organism in said plurality of 
organisms. 
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170. The method of claim 169, wherein each said expression level measurement 
is determined by measuring an amount of a cellular constituent that corresponds to the 
gene in one or more cells from an organism in said plurality of organisms. 

171. The method of claim 170, wherein said amount of said cellular constituent 
comprises an abundance of an RNA present in said one or more cells of said organism, ah 
abundance of a protein in said one or more cells of said organism, an abundance of an 
mRNA expressing said gene, or a degree of protein modification. 

1 72. The method of claim 170, wherein said abundance of said RNA is 
measured by a method comprising contacting a gene transcript array with said RNA from 
said one or more cells of said organism, or with nucleic acid derived from said RNA, 
wherein said gene transcript array comprises a positionally addressable surface with 
attached nucleic acids or nucleic acid mimics, wherein said nucleic acids or nucleic acid 
mimics are capable of hybridizing with said RNA species, or with nucleic acid derived 
from said RNA species. 

1 73. The method of claim 169, wherein said normalized expression level 
measurement is obtained by a normalization technique selected from the group consisting 
of Z-score of intensity, median intensity, log median intensity, Z-score standard deviation 
log of intensity, Z-score mean absolute deviation of log intensity, calibration DNA gene 
set, user normalization gene set, ratio median intensity correction, and intensity 
background correction. 

174. The method of claim 160, wherein said first QTL analysis comprises: 

(i) testing for linkage between (a) a genotype of said plurality of organisms at a 
position in a genome of said species and (b) said plurality of expression statistics for the 
gene; 

(ii) advancing the position in said genome by an amount; and 

(iii) repeating steps (i) and (ii) until a portion of the genome of said species has 
been tested. 

175. The method of claim 174, wherein said testing comprises performing 
linkage analysis or association analysis. 
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1 76. The method of claim 1 75, wherein said linkage analysis or association 
analysis generates a statistical score for said position in the genome of said species. 

177. The method of claim 176, wherein said testing is linkage analysis and said 
5 statistical score is a logarithm of the odds (lod) score. 

178. The method of claim 1 77, wherein said eQTL is represented by a lod score 
that is greater than 3.0. 

10 179. The method of claim 160, wherein said second QTL analysis comprises: 

(i) testing for linkage between (a) the genotype of said plurality of organisms at a 
position in a genome of said species and (b) said plurality of phenotypic values; 

(ii) advancing the position in said genome by an amount; and 

(iii) repeating steps (i) and (ii) until a portion the genome of said species has been 

15 tested. 

1 80. The method of claim 1 79, wherein said testing comprises performing 
linkage analysis or association analysis. 

20 181. The method of claim 1 80, wherein said linkage analysis or association 

analysis generates a statistical score for said position in the genome of said species. 

182. The method of claim 181, wherein said testing is linkage analysis and said 
statistical score is a logarithm of the odds (lod) score. 

25 

183. The method of claim 1 82, wherein said cQTL is represented by a lod score 
that is greater than 3.0. 

184. The method of claim 160, wherein said species is human. 

30 

185. The method of claim 160, wherein said complex trait is characterized by an 
allele that exhibits incomplete penetrance in said species. 
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186. The method of claim 160, wherein said complex trait is a disease that is 
contracted by an organism in said population, and wherein said organism inherits no 
predisposing allele to said disease. 

5 1 87. The method of claim 1 60, wherein said complex trait arises when any of a 

plurality of different genes in the genome of said species is mutated. 

188. The method of claim 160, wherein said complex trait requires the 
simultaneous presence of mutations in a plurality of genes in the genome of said species. 

10 

189. The method of claim 160, wherein said complex trait is associated with a 
high frequency of disease-causing alleles in said species. 

190. The method of claim 160, wherein said complex trait is a phenotype that 
15 does not exhibit Mendelian recessive or dominant inheritance attributable to a single gene 

locus. 

191. The method of claim 1 60, wherein said complex trait is asthma, ataxia 
telangiectasia, bipolar disorder, cancer, common late-onset Alzheimer's disease, diabetes, 

20 heart disease, hereditary early-onset Alzheimer's disease, hereditary nonpolyposis colon 
cancer, hypertension, infection, maturity-onset diabetes of the young, mellitus, migraine, 
nonalcoholic fatty liver, nonalcoholic steatohepatitis, non-insulin-dependent diabetes 
mellitus, obesity, polycystic kidney disease, psoriases, schizophrenia, or xeroderma 
pigmentosum. 

25 

192. A computer program product for use in conjunction with a computer 
system, the computer program product comprising a computer readable storage medium 
and a computer program mechanism embedded therein, the computer program mechanism 
for determining the topology of a biological pathway that affects a complex trait, the 

30 computer program mechanism comprising: 

an expression quantitative trait loci (eQTL) identification module, the eQTL 
identification module comprising: 

instructions for identifying one or more expression quantitative trait loci (eQTL) 
for a gene in a plurality of genes using a first quantitative trait loci (QTL) analysis, 
35 wherein said first QTL analysis uses a plurality of expression statistics for the gene as a 
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quantitative trait, wherein each expression statistic in said plurality of expression statistics 
represents an expression value for the gene in an organism in a plurality of organisms of a 
species; and 

instructions for repeating a first number of times said instructions for identifying 
5 one or more eQTL for a gene, wherein each repetition of said instructions for identifying 
one or more eQTL for a gene uses a different gene in said plurality of genes; 

a clinical quantitative trait loci (eQTL) identification module; the eQTL 
identification module comprising: 

instructions for identifying a clinical quantitative trait loci (eQTL) that is linked to 
10 a clinical trait in a plurality of clinical traits using a second QTL analysis, wherein said 
second QTL analysis uses a plurality of phenotypic values as a quantitative trait, wherein 
each phenotypic value in said plurality of phenotypic values represents a phenotypic value 
for the clinical trait in the plurality of clinical traits in an organism in said plurality of 
organisms; and 

15 instructions for repeating a second number of times said instructions for 

identifying one or more eQTL that is linked to a clinical trait, wherein each repetition of 
said instructions for identifying one or more eQTL that is linked to a clinical trait uses a 
different clinical trait in a plurality of clinical traits; and 

a determination module, said determination module comprising: 

20 instructions for using (i) the identity of each eQTL, identified by said eQTL 

identification module, that colocalizes with a eQTL, identified by said eQTL 
identification module, and (ii) a physical location of each gene in said plurality of genes 
in a molecular map for said species, in order to determine the topology of a biological 
pathway that affects a complex trait. 

25 

193 . The computer program product of claim 1 92, wherein three or more 
clinical traits are analyzed by said eQTL identification module. 

1 94. The computer program product of claim 1 92, wherein eight or more 
30 clinical traits are analyzed by said eQTL identification module. 

1 95 . The computer program product of claim 1 92, wherein three or more genes 
are analyzed by said eQTL identification module. 
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1 96. The computer program product of claim 1 92, wherein eight or more genes 
are analyzed by said eQTL identification module. 

1 97. The computer program product of claim 192, wherein said molecular map 
5 is a marker map and an eQTL is colocalized with a cQTL when the eQTL and the cQTL 

fall within 1 cM of each other on said marker map. 

198. The computer program product of claim 1 92, wherein said molecular map 
is a genomic map and an eQTL is colocalized with a cQTL when they fall within 500 

10 bases of each other on said genomic map. 

1 99. The computer program product of claim 1 92, wherein said first QTL 
analysis and said second QTL analysis each uses a marker map that represents the 
genome of said species. 

15 

200. The computer program product of claim 199, wherein said marker map is 
constructed from single nucleotide polymorphisms (SNPs), microsatellite markers, 
restriction fragment length polymorphisms, short tandem repeats, DNA methylation 
markers, sequence length polymorphisms, random amplified polymorphic DNA, 

20 amplified fragment length polymorphisms, or simple sequence repeats. 

20 1 . The computer program product of claim 1 92, wherein each said expression 
value is a normalized expression level measurement for the gene in an organism in said 
plurality of organisms. 

25 

202. The computer program product of claim 192, wherein said first QTL 
analysis comprises: 

(i) instructions for testing for linkage between (a) a genotype of said plurality of 
organisms at a position in a genome of said species and (b) said plurality of expression 

30 statistics for the gene; 

(ii) instructions for advancing the position in said genome by an amount; and 

(iii) instructions for repeating instructions (i) and instructions (ii) until a portion of 
the genome of said species has been tested. 
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203. The computer program product of claim 202, wherein said testing 
comprises performing linkage analysis or association analysis. 

204. The computer program product of claim 203, wherein said linkage analysis 
5 or association analysis generates a statistical score for said position in the genome of said 

species. 

205. The computer program product of claim 204, wherein said testing is 
linkage analysis and said statistical score is a logarithm of the odds (lod) score. 

10 

206. The computer program product of claim 205, wherein said eQTL is 
represented by a lod score that is greater than 3.0. 



207. The computer program product of claim 192 wherein said second QTL 
1 5 analysis comprises : 

(i) instructions for testing for linkage between (a) the genotype of said plurality of 
organisms at a position in a genome of said species and (b) said plurality of phenotypic 
values; 

(ii) instructions for advancing the position in said genome by an amount; and 
20 (iii) instructions for repeating instructions (i) and instructions (ii) until a portion 

the genome of said species has been tested. 

208. The computer program product of claim 207, wherein said testing 
comprises performing linkage analysis or association analysis. 

25 

209. The computer program product of claim 208, wherein said linkage analysis 
or association analysis generates a statistical score for said position in the genome of said 
species. 

30 210. The computer program product of claim 209, wherein said testing is 

linkage analysis and said statistical score is a logarithm of the odds (lod) score. 

211. The computer program product of claim 210, wherein said cQTL is 
represented by a lod score that is greater than 3.0. 

35 
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The computer program product of claim 192, wherein said species is 



213. The computer program product of claim 192, wherein said complex trait is 
5 characterized by an allele that exhibits incomplete penetrance in said species. 

214. The computer program product of claim 192, wherein said complex trait is 
a disease that is contracted by an organism in said population, and wherein said organism 
inherits no predisposing allele to said disease. 

10 

215. The computer program product of claim 192, wherein said complex trait 
arises when any of a plurality of different genes in the genome of said species is mutated. 

21 6. The computer program product of claim 192, wherein said complex trait 
1 5 requires the simultaneous presence of mutations in a plurality of genes in the genome of 

said species. 

217. The computer program product of claim 192, wherein said complex trait is 
associated with a high frequency of disease-causing alleles in said species. 

20 

218. The computer program product of claim 192, wherein said complex trait is 
a phenotype that does not exhibit Mendelian recessive or dominant inheritance 
attributable to a single gene locus. 

25 21 9. The computer program product of claim 192, wherein said complex trait is 

asthma, ataxia telangiectasia, bipolar disorder, cancer, common late-onset Alzheimer's 
disease, diabetes, heart disease, hereditary early-onset Alzheimer's disease, hereditary 
nonpolyposis colon cancer, hypertension, infection, maturity-onset diabetes of the young, 
mellitus, migraine, nonalcoholic fatty liver, nonalcoholic steatohepatitis, non-insulin- 

30 dependent diabetes mellitus, obesity, polycystic kidney disease, psoriases, schizophrenia, 
or xeroderma pigmentosum. 

220. A computer system for determining the topology of a biological pathway 
that affects a complex trait, the computer system comprising: 
35 a central processing unit; 
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a memory, coaled to the central processing unit, the memory storing an 
expression quantitative trait loci (eQTL) identification module, a clinical quantitative trait 
loci (cQTL) identification module, and a determination module; wherein 
the eQTL identification module comprises: 
5 instructions for identifying one or more expression quantitative trait loci (eQTL) 

for a gene in a plurality of genes using a first quantitative trait loci (QTL) analysis, 
wherein said first QTL analysis uses a plurality of expression statistics for the gene as a 
quantitative trait, wherein each expression statistic in said plurality of expression statistics 
represents an expression value for the gene in an organism in a plurality of organisms of a 
10 species; and 

instructions for repeating a first number of times said instructions for identifying 
one or more eQTL for a gene, wherein each repetition of said instructions for identifying 
one or more eQTL for a gene uses a different gene in said plurality of genes; 
the cQTL identification module comprises: 

15 instructions for identifying a clinical quantitative trait loci (cQTL) that is linked to 

a clinical trait in a plurality of clinical traits using a second QTL analysis, wherein said 
second QTL analysis uses a plurality of phenotypic values as a quantitative trait, wherein, 
each phenotypic value in said plurality of phenotypic values represents a phenotypic value 
for the clinical trait in the plurality of clinical traits in an organism in said plurality of 

20 organisms; and 

instructions for repeating a second number of times said instructions for 
identifying one or more cQTL that is linked to a clinical trait, wherein each repetition of 
said instructions for identifying one or more cQTL that is linked to a clinical trait uses a 
different clinical trait in a plurality of clinical traits; and 

25 the determination module comprises: 

instructions for using (i) the identity of each eQTL, identified by said eQTL 
identification module, that colocalizes with a cQTL, identified by said cQTL 
identification module, and (ii) a physical location of each gene in said plurality of genes 
in a molecular map for said species, in order to determine the topology of a biological 

30 pathway that affects a complex trait. 

221 . The computer system of claim 220, wherein three or more clinical traits are 
analyzed by said cQTL identification module. 



168 



WO 2004/013727 



o 



Q 



PCT/US2003/023976 



10 



15 



20 



25 



222. The computer system of claim 220, wherein eight or more clinical traits 
are analyzed by said cQTL identification module. 

223. The computer system of claim 220, wherein three or more genes are 



224. The computer system of claim 220, wherein eight or more genes are 
analyzed by said eQTL identification module. 

225. The computer system of claim 220, wherein said molecular map is a 
marker map and an eQTL is colocalized with a cQTL when the eQTL and the cQTL fall 
within 1 cM of each other on said marker map. 

226. The computer system of claim 220, wherein said molecular map is a 
genomic map and an eQTL is colocalized with a cQTL when they fall within 500 bases of 
each other on said genomic map. 

227. The computer system of claim 220, wherein said first QTL analysis and 
said second QTL analysis each uses a marker map that represents the genome of said 
species. 

228. The computer system of claim 227, wherein said marker map is 
constructed from single nucleotide polymorphisms (SNPs), microsatellite markers, 
restriction fragment length polymorphisms, short tandem repeats, DNA methylation 
markers, sequence length polymorphisms, random amplified polymorphic DNA, 
amplified fragment length polymorphisms, or simple sequence repeats. 

229. The computer program system of claim 220, wherein each said expression 
value is a normalized expression level measurement for the gene in an organism in said 
plurality of organisms. 

230. The computer system of claim 220, wherein said first QTL analysis 
comprises: 



analyzed by said eQTL identification module. 
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(i) instructions for testing for linkage between (a) a genotype of said plurality of 
organisms at a position in a genome of said species and (b) said plurality of expression 
statistics for the gene; 

(ii) instructions for advancing the position in said genome by an amount; and 

5 (iii) instructions for repeating instructions (i) and instructions (ii) until a portion of 

the genome of said species has been tested. 

231. The computer system of claim 230, wherein said testing comprises 
performing linkage analysis or association analysis. 

10 

232. The computer system of claim 23 1, wherein said linkage analysis or 
association analysis generates a statistical score for said position in the genome of said 
species. 

15 233. The computer system of claim 232, wherein said testing is linkage analysis 

and said statistical score is a logarithm of the odds (lod) score. 

234. The computer system of claim 233, wherein said eQTL is represented by a 
lod score that is greater than 3.0. 

20 

235. The computer system of claim 220, wherein said second QTL analysis 
comprises: 

(i) instructions for testing for linkage between (a) the genotype of said plurality of 
organisms at a position in a genome of said species and (b) said plurality of phenotypic 

25 values; 

(ii) instructions for advancing the position in said genome by an amount; and 

(iii) instructions for repeating instructions (i) and instructions (ii) until a portion 
the genome of said species has been tested. 

30 236. The computer system of claim 235, wherein said testing comprises 

performing linkage analysis or association analysis. 

237. The computer system of claim 236, wherein said linkage analysis or 
association analysis generates a statistical score for said position in the genome of said 
35 species. 
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238. The computer system of claim 237, wherein said testing is linkage analysis 
and said statistical score is a logarithm of the odds (lod) score. 

239. The computer system of claim 238, wherein said cQTL is represented by a 
lod score that is greater than 3.0. 



240. The computer system of claim 220, wherein said species is human. 

241. The computer system of claim 220, wherein said complex trait is 
characterized by an allele that exhibits incomplete penetrance in said species. 



242. The computer system of claim 220, wherein said complex trait is a disease 
that is contracted by an organism in said population, and wherein said organism inherits 

15 no predisposing allele to said disease. 

243. The computer system of claim 220, wherein said complex trait arises when 
any of aplurality of different genes in the genome of said species is mutated. 

20 244. The computer system of claim 220, wherein said complex trait requires the 

simultaneous presence of mutations in a plurality of genes in the genome of said species. 

245. The computer system of claim 220, wherein said complex trait is 
associated with a high frequency of disease-causing alleles in said species. 

25 

246. The computer system of claim 220, wherein said complex trait is a 
phenotype that does not exhibit Mendelian recessive or dominant inheritance attributable 
to a single gene locus. 

30 247. The computer system of claim 220, wherein said complex trait is asthma, 

ataxia telangiectasia, bipolar disorder, cancer, common late-onset Alzheimer's disease, 
diabetes, heart disease, hereditary early-onset Alzheimer's disease, hereditary 
nonpolyposis colon cancer, hypertension, infection, maturity-onset diabetes of the young, 
mellitus, migraine, nonalcoholic fatty liver, nonalcoholic steatohepatitis, non-insulin- 
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dependent diabetes mellitus, obesity, polycystic kidney disease, psoriases, schizophrenia, 
or xeroderma pigmentosum. 



248. The method of claim 1 , the method further comprising: 

5 determining whether said eQTL for said gene G appears in a third quantitative 

trait loci analysis, wherein said third QTL analysis uses a plurality of expression statistics 
for gene G as a quantitative trait, wherein each expression statistic in said plurality of 
expression statistics represents an expression value for gene G in an organism in said 
plurality of organisms in which said gene has been knocked out. 

10 

249. The method of claim 248 wherein said gene is knocked out using in vivo 
siRNA, a gene knock-out mouse strain, or a transgenic mouse strain. 

250. The computer program product of claim 54, the computer program 
1 5 mechanism further comprising: 

a validation module for determining whether said eQTL for said gene G appears 
in a third quantitative trait loci analysis, wherein said third QTL analysis uses a plurality 
of expression statistics for gene G as a quantitative trait, wherein each expression statistic 
in said plurality of expression statistics represents an expression value for gene G in an 
20 organism in said plurality of organisms in which said gene has been knocked out. 

25 1 . The computer system of claim 1 07, the memory further comprising: 
a validation module that comprises instructions for determining whether said 

eQTL for said gene G appears in a third quantitative trait loci analysis, wherein said third 
25 QTL analysis uses a plurality of expression statistics for gene G as a quantitative trait, 
wherein each expression statistic in said plurality of expression statistics represents an 
expression value for gene G in an organism in said plurality of organisms in which said 
gene has been knocked out. 

30 252. The method of claim 1, the method further comprising: 

(D) validating said association between said gene G and said clinical trait T by 
testing for genetic linkage between said expression quantitative trait loci (eQTL) and said 
clinical quantitative trait loci (eQTL). 
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253. The method of claim 252 wherein said testing for genetic linkage comprises 
marker-difference regression or a multiple-trait extension of composite interval mapping. 

254. The computer program product of claim 54, the computer program 

5 mechanism further comprising instructions for validating said association between said 
gene G and said clinical trait T by testing for genetic linkage between said expression 
quantitative trait loci (eQTL) and said clinical quantitative trait loci (cQTL); 

255. The computer program product of claim 254 wherein said testing for genetic 
10 linkage comprises marker-difference regression or a multiple-trait extension of composite 

interval mapping. 

256. The computer system of claim 107, the computer the memory further 
comprising instructions for validating said association between said gene G and said 

1 5 clinical trait T by testing for genetic linkage between said expression quantitative trait loci 
(eQTL) and said clinical quantitative trait loci (cQTL). 

257. The computer system of claim 256 wherein said testing for genetic linkage 

• comprises marker-difference regression or a multiple-trait extension of composite interval 
20 mapping. 

258. The method of claim 5 wherein said test for pleiotropy comprises comparing 
a null hypothesis, indicating that said eQTL and said cQTL are represented by a QTL that 
is common to both said eQTL and said cQTL, to an alternative hypothesis, indicating 

25 linkage disequilibrium. 

259. The method of claim 258 wherein said null hypothesis is: 



iVl.fA 



+ 



'7, 



\ £ lj 



30 wherein 

Q is a categorical random variable indicating the genotypes at the position of said 
eQTL and said cQTL in said plurality of organisms; 



173 



WO 2004/013727 



10 



^ is distributed as a bivariate normal random variable with mean 



PCT/US20O3/O23976 
'0^ 



and 



covariance matrix 



K a 2 a x 



'2 j 



; and 



m and pi are model parameters. 



260. The method of claim 258 wherein the alternative hypothesis is: 



UJ UJ Ia aJU 



wherein 

Qi and Q2 are categorical random variables indicating the genotypes at the 
positions of said eQTL and said cQTL in said plurality of organisms; 



is distributed as a bivariate normal random variable with mean 



and 



f _J 



covariance matrix 



; and 



|Xi and p,- are model parameters. 
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261 . The method of claim 258 wherein the alternative hypothesis is: 



yi 



Mi 



'A AYaVlV 
J 3 AAOj U> 



wherein 

Qj and Q2 are categorical random variables indicating the genotypes at the 
positions of said eQTL and said cQTL in said plurality of organisms; 



V 



is distributed as a bivariate normal random variable with mean f j and 



20 covariance matrix 



<T,(7 2 



7 i y 



joj and Pi are model parameters; and 
one of conditions (i) through (iv) is valid: 

(i) P, p 4 A, P2 = 0, and p 3 = 0; 

(ii) pi *0, p 4 p 2 *0, and p 3 = 0; 
(hi) p, ;*0, p 4 ?*0, p 2 = 0, and p 3 ?*0; and 
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262. The method of claim 258 wWi„ti,« 
263. The computer program product of claim 5R u/ii • 



15 wherein 



264. The computer program product of claim 9« „ u . 

1 Clam 263 whe reui said null hypothesis 



> 

is: 



0 is a categorical random variable indicating the m™*™, . . 
covariance matrix [ ^ ^ ] . ^ 



20 Mi and p\- are model parameters. 

^^^^^^^ 



hypothesis is: 



25 wherein 



wan im 



Q< and &areca t egorical random variables tadicat™ a* m™,™ t 
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■r 



* l I is distributed as a bivariate nonnal random variable with mean 



PCT/US2003/023976 
1 ^ I and 



covariance matrix 



K a 2 cr { 



<J,<T 2 



'2 J 



; and 



jij and pi are model parameters. 



5 266. The computer program product of claim 263 wherein the alternative 

hypothesis is: 

wherein 



Qj and & are categorical random variables indicating the genotypes at the 
10 positions of said eQTL and said cQTL in said plurality of organisms; 



V 

^2, 



is distributed as a bivariate nonnal random variable with mean 



and 



covariance matrix 



<7,<T 2 



Hi and Pi are model parameters; and 
one of conditions (i) through (iv) is valid: 
15 (i)Pi p4 *0, P2 - 0, and p 3 = 0; 

(ii) pi *0, p4 P2 54), and p 3 = 0; 

(iii) Pi j4>, p 4 r4), p 2 = 0, and p 3 j4); and 

(iv) Pi 5*0, p 4 *0, Pz *0, and p 3 ?4). 

20 267. The computer program product of claim 263 wherein the negative 

loglikelihood for the null hypothesis and the alternative hypothesis are minimized using 
maximum likelihood analysis thereby forming a likelihood ratio test statistic. 



268. The computer system of claim 111 wherein said test for pleiotropy 
25 comprises comparing a null hypothesis, indicating that said eQTL and said cQTL are 

represented by a QTL that is common to both said eQTL and said cQTL, to an alternative 
hypothesis, indicating linkage disequilibrium. 
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269. The computer program product of claim 268 wherein said null hypothesis is: 



10 



15 



20 



25 



M 



Mi 



wherein 

Q is a categorical random variable indicating the genotypes at the position of said 
eQTL and said cQTL in said plurality of organisms; 



a- 



distributed as a bivariate normal random variable with mean 



and 



covariance matrix 



r „2 „ „ <\ 



; and 



jii and pi are model parameters. 

270. The computer program product of claim 268 wherein the alternative 
hypothesis is: 



wherein 

Qi and Q2 are categorical random variables indicating the genotypes at the 
positions of said eQTL and said cQTL in said plurality of organisms; 



\ S 2j 



is distributed as a bivariate normal random variable with mean 



and 



covariance matrix 



r ^ rrrr \ 



; and 



|Lij and Pi are model parameters. 

271. The computer program product of claim 268 wherein the alternative 
hypothesis is: 



(V 



Mi 

.Mi, 



(A A) 

J 3 A) 



wherein 

Qj and Q2 are categorical random variables indicating the genotypes at the 
positions of said eQTL and said cQTL in said plurality of organisms; 
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\ 61 ] is distributed as a bivariate normal random variable with mean f ^ | and 



10 



covariance matrix 



Ui and |3i are model parameters; and 
one of conditions (i) through (iv) is valid: 

(i) p, p 4 *0, h = 0, and p 3 - 0; 

(ii) p, ?0, p 4 54), P2 5^0, and p 3 = 0; 

(iii) Pi p4 *0, P2 = 0, and p 3 and 

(iv) Pi *0, p4 5*0, P2 and p 3 5*0. 

272. The computer program product of claim 268 wherein the negative 
loglikelihood for the null hypothesis and the alternative hypothesis are minimized using 
maximum likelihood analysis thereby forming a likelihood ratio test statistic. 
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